Search CORE

16 research outputs found

Chunking clinical text containing non-canonical language

Author: Carroll John
Cassell Jackie
Savkov Aleksandar
Publication venue
Publication date: 01/01/2014
Field of study

Free text notes typed by primary care physicians during patient consultations typically contain highly non-canonical language. Shallow syntactic analysis of free text notes can help to reveal valuable information for the study of disease and treatment. We present an exploratory study into chunking such text using off-the-shelf language processing tools and pre-trained statistical models. We evaluate chunking accuracy with respect to part-of-speech tagging quality, choice of chunk representation, and breadth of context features. Our results indicate that narrow context feature windows give the best results, but that chunk representation and minor differences in tagging quality do not have a significant impact on chunking accuracy

CiteSeerX

Crossref

Sussex Research Online

Towards objectively evaluating the quality of generated medical summaries

Author: Juric Damir
Moramarco Francesco
Reiter Ehud
Savkov Aleksandar
Publication venue: ACL Anthology
Publication date: 09/04/2021
Field of study

Publisher PD

arXiv.org e-Print Archive

Aberdeen University Research

Recommended from our members

Deciphering clinical text: concept recognition in primary care text notes

Author: Savkov Aleksandar Dimitrov
Publication venue
Publication date: 03/05/2017
Field of study

Electronic patient records, containing data about the health and care of a patient, are a valuable source of information for longitudinal clinical studies. The General Practice Research Database (GPRD) has collected patient records from UK primary care practices since the late 1980s. These records contain both structured data (in the form of codes and numeric values) and free text notes. While the structured data have been used extensively in clinical studies, there are significant practical obstacles in extracting information from the free text notes. The main obstacles are data access restrictions, due to the presence of sensitive information, and the specific language of medical practitioners, which renders standard language processing tools ineffective. The aim of this research is to investigate approaches for computer analysis of free text notes. The research involved designing a primary care text corpus (the Harvey Corpus) annotated with syntactic chunks and clinically-relevant semantic entities, developing a statistical chunking model, and devising a novel method for applying machine learning for entity recognition based on chunk annotation. The tools produced would facilitate reliable information extraction from primary care patient records, needed for the development of clinically-related research. The three medical concept types targeted in this thesis could contribute to epidemiological studies by enhancing the detection of co-morbidities, and better analysing the descriptions of patient experiences and treatments. The main contributions of the research reported in this thesis are: guidelines for chunk and concept annotation of clinical text, an approach to maximising agreement between human annotators, the Harvey Corpus, a method for using a standard part-of-speech tagging model in clinical text chunking, and a novel approach to recognising clinically relevant medical concepts

Sussex Research Online

A preliminary study on evaluating Consultation Notes with Post-Editing

Author: Moramarco Francesco
Papadopoulos Korfiatis Alex
Reiter Ehud
Savkov Aleksandar
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 09/04/2021
Field of study

Publisher PD

arXiv.org e-Print Archive

Aberdeen University Research

Consultation Checklists : Standardising the Human Evaluation of Medical Note Generation

Author: Belz Anya
Moramarco Francesco
Papadopoulos Korfiatis Alex
Perera Mark
Reiter Ehud
Savkov Aleksandar
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/12/2022
Field of study

Publisher PD

Aberdeen University Research

DCU Online Research Access Service

Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Author: Belz Anya
Flann Jack
Juric Damir
Korfiatis Alex Papadopoulos
Moramarco Francesco
Perera Mark
Reiter Ehud
Savkov Aleksandar
Publication venue
Publication date: 01/01/2022
Field of study

In recent years, machine learning models have rapidly become better at generating clinical consultation notes; yet, there is little work on how to properly evaluate the generated consultation notes to understand the impact they may have on both the clinician using them and the patient's clinical safety. To address this we present an extensive human evaluation study of consultation notes where 5 clinicians (i) listen to 57 mock consultations, (ii) write their own notes, (iii) post-edit a number of automatically generated notes, and (iv) extract all the errors, both quantitative and qualitative. We then carry out a correlation study with 18 automatic quality metrics and the human judgements. We find that a simple, character-based Levenshtein distance metric performs on par if not better than common model-based metrics like BertScore. All our findings and annotations are open-sourced.Comment: To be published in proceedings of ACL 202

arXiv.org e-Print Archive

Aberdeen University Research

DCU Online Research Access Service

Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Author: Belz Anja
Flann Jack
Juric Damir
Korfiatis Alex Papadopoulos
Moramarco Francesco
Perera Mark
Reiter Ehud
Savkov Aleksandar
Publication venue: Association for Computational Linguistics
Publication date: 01/05/2022
Field of study

The authors would like to thank Rachel Young and Tom Knoll for supporting the team and hiring the evaluators, Vitalii Zhelezniak for his advice on revising the paper, and Kristian Boda for helping to set up the Stanza+Snomed fact-extraction system.Publisher PD

Aberdeen University Research